Search CORE

7,759 research outputs found

XQ2P: Efficient XQuery P2P Time Series Processing

Author: Butnaru Bogdan
Gardarin Georges
Nguyen Benjamin
Yeh Laurent
Publication venue
Publication date: 20/10/2009
Field of study

In this demonstration, we propose a model for the management of XML time series (TS), using the new XQuery 1.1 window operator. We argue that centralized computation is slow, and demonstrate XQ2P, our prototype of efficient XQuery P2P TS computation in the context of financial analysis of large data sets (>1M values)

arXiv.org e-Print Archive

HAL UVSQ

XML content warehousing: Improving sociological studies of mailing lists and web data

Author: Colazzo Dario
Dudouet François-Xavier
Manolescu Ioana
Nguyen Benjamin
Senellart Pierre
Vion Antoine
Publication venue
Publication date: 01/01/2011
Field of study

In this paper, we present the guidelines for an XML-based approach for the sociological study of Web data such as the analysis of mailing lists or databases available online. The use of an XML warehouse is a flexible solution for storing and processing this kind of data. We propose an implemented solution and show possible applications with our case study of profiles of experts involved in W3C standard-setting activity. We illustrate the sociological use of semi-structured databases by presenting our XML Schema for mailing-list warehousing. An XML Schema allows many adjunctions or crossings of data sources, without modifying existing data sets, while allowing possible structural evolution. We also show that the existence of hidden data implies increased complexity for traditional SQL users. XML content warehousing allows altogether exhaustive warehousing and recursive queries through contents, with far less dependence on the initial storage. We finally present the possibility of exporting the data stored in the warehouse to commonly-used advanced software devoted to sociological analysis

arXiv.org e-Print Archive

HAL-CentraleSupelec

Base de publications de l'université Paris-Dauphine

HAL AMU

INRIA a CCSD electronic archive server

The WebStand Project

Author: Colazzo Dario
Dudouet François-Xavier
Manolescu Ioana
Nguyen Benjamin
Senellart Pierre
Vion Antoine
Publication venue
Publication date: 01/01/2009
Field of study

In this paper we present the state of advancement of the French ANR WebStand project. The objective of this project is to construct a customizable XML based warehouse platform to acquire, transform, analyze, store, query and export data from the web, in particular mailing lists, with the final intension of using this data to perform sociological studies focused on social groups of World Wide Web, with a specific emphasis on the temporal aspects of this data. We are currently using this system to analyze the standardization process of the W3C, through its social network of standard setters

arXiv.org e-Print Archive

HAL-CentraleSupelec

Base de publications de l'université Paris-Dauphine

CiteSeerX

HAL AMU

INRIA a CCSD electronic archive server

The supply chain for electric car batteries is changing the world's geopolitics

Author: Elliott Robert
Harper Gavin
Jones Benjamin
Nguyen Viet Nguyen-Tien
Publication venue: The London School of Economics and Political Science
Publication date: 27/06/2023
Field of study

The rising demand for electric vehicles is changing the geopolitical landscape, as the world pivots away from fossil fuels towards the materials critical to the EV supply chain. As manufacturers and countries race to secure the supply of raw materials for EV batteries, new opportunities and geopolitical risks are emerging. Benjamin Jones, Viet Nguyen-Tien, Robert Elliott and Gavin Harper write about the implications of the race for battery-critical resources

LSE Research Online

Distributed Transition Systems with Tags for Privacy Analysis

Author: Anantharaman Siva
Frittella Sabine
Nguyen Benjamin
Publication venue
Publication date: 06/04/2022
Field of study

We present a logical framework that formally models how a given private information P stored on a given database D, can get captured progressively, by an agent/adversary querying the database repeatedly.Named DLTTS (Distributed Labeled Tagged Transition System), the frame-work borrows ideas from several domains: Probabilistic Automata of Segala, Probabilistic Concurrent Systems, and Probabilistic labelled transition systems. To every node on a DLTTS is attached a tag that represents the 'current' knowledge of the adversary, acquired from the responses of the answering mechanism of the DBMS to his/her queries, at the nodes traversed earlier, along any given run; this knowledge is completed at the same node, with further relational deductions, possibly in combination with 'public' information from other databases given in advance. A 'blackbox' mechanism is also part of a DLTTS, and it is meant as an oracle; its role is to tell if the private information has been deduced by the adversary at the current node, and if so terminate the run. An additional special feature is that the blackbox also gives information on how 'close',or how 'far', the knowledge of the adversary is, from the private information P , at the current node. A metric is defined for that purpose, on the set of all 'type compatible' tuples from the given database, the data themselves being typed with the headers of the base. Despite the transition systems flavor of our framework, this metric is not 'behavioral' in the sense presented in some other works. It is exclusively database oriented,and allows to define new notions of adjacency and of -indistinguishabilty between databases, more generally than those usually based on the Hamming metric (and a restricted notion of adjacency). Examples are given all along to illustrate how our framework works. Keywords:Database, Privacy, Transition System, Probability, Distribution

arXiv.org e-Print Archive